A New Approach to Segmentation of Persian Cursive Script based on Adjustment the Fragments
نویسنده
چکیده
Optical Character Recognition (OCR) is a very old and of great interest in pattern recognition field. The recognition of cursive scripts like Persian and Arabic languages is a difficult task as their segmentation suffers from serious problems in different languages. Segmentation is a process of dividing cursive words into smaller parts in order to decrease complexity and increase accuracy of recognition process. In this paper, an improved segmentation method of the Persian script has been presented and to increase the quality of segmentation, some structural features of Persian language is used to adjust the fragments. This method is robust as well as flexible. It also increases the system's tolerances to font variations. The proposed method is able to segment existing Persian fonts up to 99. 2% accuracy.
منابع مشابه
Robust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach
The presence of a large number of unique shapes called ligatures in cursive languages, along with variations due to scaling, orientation and location provides one of the most challenging pattern recognition problems. Recognition of the large number of ligatures is often a complicated task in oriental languages such as Pashto, Urdu, Persian and Arabic. Research on cursive script recognition ofte...
متن کاملA New Segmentation Algorithm for Online Handwritten Word Recognition in Persian Script
The cursive nature of Persian alphabet, and the complex and convoluted rules regarding this script cause major challenges to segmentation as well as recognition of Persian words. We propose a new segmentation algorithm for the main stroke of online Persian handwritten words. Using this segmentation, we present a perturbation method which is used to generate artificial samples from handwritten w...
متن کاملSegmentation of Persian Cursive Words Using Basic Shapes
Segmentation is a process of dividing cursive words into smaller parts in order to decrease complexity and increase accuracy of handwriting recognition process. However it is a complicated and timeconsuming task. In this paper, we introduce the concepts of basic shapes and explore its application for segmentation of Persian words. Considering a set of pre-defined shapes include line and open or...
متن کاملA Novel Approach to Persian Online Hand Writing Recognition
Persian (Farsi) script is totally cursive and each character is written in several different forms depending on its former and later characters in the word. These complexities make automatic handwriting recognition of Persian a very hard problem and there are few contributions trying to work it out. This paper presents a novel practical approach to online recognition of Persian handwriting whic...
متن کاملA Robust Free Size OCR for Omni-Font Persian/Arabic Printed Document Using Combined MLP/SVM
Optical character recognition of cursive scripts present a number of challenging problems in both segmentation and recognition processes and this attracts many researches in the field of machine learning. This paper presents a novel approach based on a combination of MLP and SVM to design a trainable OCR for Persian/Arabic cursive documents. The implementation results on a comprehensive databas...
متن کامل